Notifications for deviations in depiction of different objects in filmed shots of video content

ABSTRACT

A video editing device includes a memory that stores first video content and includes circuitry that extracts a first image frame of a first filmed shot of the first video content and a second image frame of a second filmed shot of the first video content. The first image frame is a last frame of the first filmed shot and the second image frame is a first frame of the second filmed shot. The circuitry detects a first set of objects from the extracted first image frame and a second set of objects from the extracted second image frame. The circuitry further generates and outputs notification information associated with a first object based on a deviation of at least a first visual attribute of the first object in the first image frame from at least a second visual attribute of the first object in the second image frame.

CROSS-REFERENCE TO RELATED APPLICATIONS/INCORPORATION BY REFERENCE

None.

FIELD

Various embodiments of the disclosure relate to video editingtechnologies. More specifically, various embodiments of the disclosurerelate to a video editing device, a method, and a software to notifydeviations in depiction of different objects in filmed shots of videocontent.

BACKGROUND

Recent advancements in the field of video editing have led todevelopment of various techniques that facilitate Non-Linear Editing(NLE) of video content. Traditionally, the video content may be capturedby one or more image capture devices, in numerous production states atdifferent times and/or production locations. The video content (forexample, post-production movie content) includes a plurality of scenesand each of the plurality of scenes includes numerous consecutive shots.Typically, the consecutive shots of the scene may be captured eithercontinuously or after a certain duration (for example after certainhours or days). Various conventional techniques are available throughwhich an editor may perform various editing functions on the capturedshots to generate the scene or the video content. The editor may need tomanually analyze continuity errors associated with depiction of variousobjects (for example, humans or non-living entities) present in multipleshots of one or more scenes. The identification of continuity errors maybe required to avoid a discontinuity or prevent an anomaly in depictionof various objects in between the multiple shots of the one or morescenes. Manual analysis leads to inaccuracy in identification of thecontinuity errors in depiction of the objects. This may further reduceoverall quality of the video content and the editor's credibility amongthe audiences of the video content.

Further limitations and disadvantages of conventional and traditionalapproaches will become apparent to one of skill in the art, throughcomparison of described systems with some aspects of the presentdisclosure, as set forth in the remainder of the present application andwith reference to the drawings.

SUMMARY

A video editing device and a method to control notifications fordeviations in depiction of different objects in filmed shots of videocontent is provided substantially as shown in, and/or described inconnection with, at least one of the figures, as set forth morecompletely in the claims.

These and other features and advantages of the present disclosure may beappreciated from a review of the following detailed description of thepresent disclosure, along with the accompanying figures in which likereference numerals refer to like parts throughout.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an exemplary environment for control of notificationsfor deviations in depiction of different objects in filmed shots ofvideo content, in accordance with an embodiment of the disclosure.

FIG. 2 is a block diagram that illustrates an exemplary video editingdevice for control of notifications for deviations in depiction ofdifferent objects in filmed shots of video content, in accordance withan embodiment of the disclosure.

FIG. 3 illustrates a first exemplary scenario for control ofnotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure.

FIGS. 4A and 4B, collectively, illustrate a user interface to displaynotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure.

FIG. 5 illustrates a second exemplary scenario for control ofnotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure.

FIG. 6 illustrates a third exemplary scenario for control ofnotifications for factual errors in depiction of different objects infilmed shots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure.

FIG. 7 illustrates a fourth exemplary scenario for control ofnotifications for duplicacy in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure.

FIGS. 8A and 8B, collectively, depict a flowchart that illustratesexemplary operations for control of notifications for deviations indepiction of different objects in filmed shots of video content, inaccordance with an embodiment of the disclosure.

DETAILED DESCRIPTION

The following described implementations may be found in the discloseddevice and method to control notifications for deviations in depictionof different objects in filmed shots of video content. Exemplary aspectsof the disclosure provide a video editing device which automaticallychecks continuity of different objects between multiple filmed shots ofa scene in the video content. The automated continuity check by thevideo editing device ensures consistency between the multiple filmedshots. The video editing device further provides one or morenotifications to an editor of the video content about deviations indepiction of the different objects present between the multiple filmedshots based on the automatic continuity check. Such notifications mayfurther assist the editor to improve overall quality of the videocontent during production and post-production phases of the videocontent.

The video editing device may include a memory configured to store thevideo content (for example a movie) that may include a set of filmedscenes of a filmed sequence. The set of filmed scenes may include aplurality of filmed shots. The plurality of filmed shots of a filmedscene may be consecutive shots captured continuously or after certainduration (for example after certain hours or days) by a video capturingdevice. The plurality of filmed shots in the filmed scene may include aplurality of objects which may be common in the plurality of filmedshots. The plurality of objects may be various living objects (forexample humans) and non-living objects. The disclosed video editingdevice may compare the plurality of objects between the filmed shots todetermine whether the plurality of objects are depicted with similarvisual attributes in the filmed shots. The disclosed video editingdevice may generate a three-dimensional (3D) images of the plurality ofobjects for accurate comparison of the visual attributes. Examples ofthe visual attributes associated with the plurality of objects mayinclude, but are not limited to, shape, a size, a dimension, a color, abrightness value, a contrast value, a texture, an orientation, an angleof tilt, a posture, or textual information. The video editing device mayfurther determine deviation of the plurality of objects between thefilmed shots based on the comparison. The video editing device mayautomatically check the continuity of the plurality of objects betweenthe filmed shots based on the determined deviation. The video editingdevice may further provide notifications about various errors in thecontinuity between the filmed shots to the editor of the video content.The video editing device may also determine factual errors in the filmedshots based on a timeline of the captured video content and respectivestored timelines of the plurality of objects, and further provide thenotification to the editor based on the determined factual errors. Thevideo editing device may also provide notification to the editor basedon the comparison of the plurality of objects in the video content withsimilar objects present in another video content stored in a database.Based on such automated notifications from the video editing device, theeditor may take appropriate decisions either to reshoot the filmed shotsor apply different image processing techniques to maintain continuity ofthe plurality of objects and reduce the detected factual errors. Thisimproves overall quality of the video content finally produced after theediting phase and further enhance overall credibility of the editor toproduce high-quality video content in lesser time.

FIG. 1 illustrates an exemplary environment for control of notificationsfor deviations in depiction of different objects in filmed shots ofvideo content, in accordance with an embodiment of the disclosure. Withreference to FIG. 1, there is shown an network environment 100. Thenetwork environment 100 may include a video editing device 102, a datastorage system 102A, a display screen 104, a server 106, and acommunication network 108. The video editing device 102 may becommunicatively coupled to the server 106, via the communication network108. There is further shown a user 110 associated with the video editingdevice 102.

With reference to FIG. 1, there is further shown a first video content112 which may include a set of filmed scenes 114A to 1148. The set offilmed scenes 114A to 1148 may include a first filmed scene 114A and asecond filmed scene 1148. The first filmed scene 114A may include aplurality of filmed shots 116A to 116C. Similarly, the second filmedscene 1148 may include another plurality of filmed shots 116D to 116F. Afirst filmed shot 116A may include a sequence of image frames 118 and asecond filmed shot 1168 may include a sequence of image frames 120.There is further shown a first image frame 122 of the first filmed shot116A and a second image frame 124 of the second filmed shot 1168. Inaccordance with an embodiment, the first image frame 122 may be a lastframe of the sequence of image frames 118 of the first filmed shot 116Aand the second image frame 124 may a first frame of the sequence ofimage frames 120 of the second filmed shot 1168.

The video editing device 102 may comprise suitable logic, circuitry andinterfaces that may be configured to receive an editing request from theuser 110 (for example an editor) and retrieve the first video content112 from the data storage system 102A based on the received editingrequest. In accordance with an embodiment, the video editing device 102may be configured to detect a first set of objects (not shown) in thefirst image frame 122 and detect a second set of objects (not shown) inthe second image frame 124. The video editing device 102 may be furtherconfigured to check whether a first object in the detected first set ofobjects is absent in the detected second set of objects in the secondimage frame 124. In some embodiments, the video editing device 102 maybe configured to determine a deviation of a first visual attribute ofthe first object in the first image frame 122 from a second visualattribute of the first object in the second image frame 124. Inaccordance with an embodiment, the video editing device 102 may befurther configured to determine continuity or discontinuity of the firstobject between the first image frame 122 and the second image frame 124based on the determined deviation. In some embodiments, the videoediting device 102 may be configured to determine the continuity or thediscontinuity of the first object based on the determination that thefirst object of the first image frame 122 is absent in the second imageframe 124. In accordance with an embodiment, the video editing device102 may be configured to detect factual errors with at least one of thedetected first set of objects and/or at least one of the detected secondset of objects in at least one image frame in the set of filmed scenes114A to 114B. The video editing device 102 may be further configured tooutput notification information for the user 110 based on the determineddiscontinuity or the factual errors. Examples of the video editingdevice 102 may include, but are not limited to, a non-linear editingsystem (NLE), a non-linear video editing system (NLVE), a video editingsystem, a video editing controller, a media production system, acomputer workstation, a mainframe computer, a handheld computer, acellular/mobile phone, a smart appliance, a video player, and/or othercomputing device with image processing capability.

The data storage system 102A may comprise suitable logic, circuitry, andinterfaces that may be configured to store the first video content 112which may include the set of filmed scenes 114A to 114B. In accordancewith an embodiment, the data storage system 102A may be configured tostore first timeline information which indicates an era related to thefirst video content 112. In accordance with an embodiment, the datastorage system 102A may be configured to store metadata associated withthe plurality of filmed shots 116A to 116F. The data storage system 102Amay be configured to store the notification information associated withthe first object for which the video editing device 102 determines thediscontinuity or the factual error. In accordance with an embodiment,the data storage system 102A may be a memory (not shown) or database(not shown) integrated in the video editing device 102.

The display screen 104 may comprise suitable logic, circuitry, andinterfaces that may be configured to display the notificationinformation for the user 110. The display screen 104 may becommunicatively coupled with the video editing device 102, via acommunication channel (not shown). The display screen 104 may berealized through several known technologies such as, but not limited to,at least one of a Liquid Crystal Display (LCD) display, a Light EmittingDiode (LED) display, a plasma display, and an Organic LED (OLED) displaytechnology, and other display. In accordance with an embodiment, thedisplay screen 104 may refer to a display screen of smart-glass device,a see-through display, a projection-based display, an electro-chromicdisplay, and a transparent display. A person with ordinary skill in theart will understand that the scope of the disclosure is not limited toan implementation of the video editing device 102 and the display screen104 as separate entities. In accordance with an embodiment, thefunctionalities of the display screen 104 may be implemented by thevideo editing device 102, without departure from the scope of thedisclosure.

The server 106 may comprise suitable logic, circuitry, and interfacesthat may be configured to store the first video content 112 which mayinclude the set of filmed scenes 114A to 114B. The server 106 may beconfigured to store the first timeline information which indicates theera related to the first video content 112. In accordance with anembodiment, the server 106 may be configured to store second timelineinformation of different objects of different object types. The secondtimeline information may indicate timing information of invention,discovery, use, market release, or evolution of the different objects ofdifferent object types. In accordance with an embodiment, the server 106may be configured to store the metadata associated with the plurality offilmed shots 116A to 116F. The metadata associated with the plurality offilmed shots 116A to 116F may include, but are not limited to,information that may indicate a geo-location at which each of theplurality of filmed shots 116A to 116F are captured, artist information,information about the detected first set of objects, and the detectedsecond set of objects. In accordance with an embodiment, the server 106may be further configured to store a second video content where thesecond video content may include a plurality of filmed shots withassociated metadata.

In accordance with an embodiment, the server 106 may be configured toreceive a content request from the video editing device 102, via thecommunication network 108. The server 106 may be further configured totransmit the stored first video content 112, the first timelineinformation, the second timeline information, the metadata, and thesecond video content to the video editing device 102, via thecommunication network 108, based on the received content request. Inaccordance with an embodiment, the server 106 may be implemented as acloud server, which may be utilized to execute aforementioned operationsof the video editing device 102 through web applications, cloudapplications, HTTP requests, repository operations, file transfer,gaming operations, and the like. Other examples of the server 106 mayinclude, but are not limited to a database server, a file server, a webserver, an application server, a mainframe server, a cloud server, orother types of server.

In accordance with an embodiment, the server 106 may be part of thevideo editing device 102. In accordance with an embodiment, the server106 may be implemented as a plurality of cloud-based resources by use ofseveral technologies that are well known to those skilled in the art.Further, the server 106 may be associated with a single or multipleservice providers. A person with ordinary skill in the art willunderstand that the scope of the disclosure is not limited toimplementation of the server 106 and the video editing device 102 asseparate entities. In accordance with an embodiment, the functionalitiesof the server 106 may be implemented by the video editing device 102,without departure from the scope of the disclosure.

The communication network 108 may include a communication medium throughwhich the video editing device 102 and the server 106 may communicatewith each other. The communication network 108 may be a wired orwireless communication network. Examples of the communication network108 may include, but are not limited to, the Internet, a cloud network,a Wireless Fidelity (Wi-Fi) network, a Personal Area Network (PAN), aLocal Area Network (LAN), or a Metropolitan Area Network (MAN). Variousdevices in the network environment 100 may be configured to connect tothe communication network 108, in accordance with various wired andwireless communication protocols. Examples of such wired and wirelesscommunication protocols may include, but are not limited to, at leastone of a Transmission Control Protocol and Internet Protocol (TCP/IP),User Datagram Protocol (UDP), Hypertext Transfer Protocol (HTTP), FileTransfer Protocol (FTP), Zig Bee, EDGE, IEEE 802.11, light fidelity(Li-Fi), 802.16, IEEE 802.11s, IEEE 802.11g, multi-hop communication,wireless access point (AP), device to device communication, cellularcommunication protocols, and Bluetooth (BT) communication protocols.

In operation, the video editing device 102 may be configured to receivea first user input from the user 110. The first user input may include arequest to perform a continuity check on the first video content 112.Examples of the first video content 112 may include, but are not limitedto, movie content, a song video scene, multimedia content, animationcontent, interactive content and/or a combination thereof. The firstvideo content 112 may be stored in a memory (shown in FIG. 2) of thevideo editing device 102. In accordance with an embodiment, the firstvideo content 112 may include the set of filmed scenes 114A to 114B of afilmed sequence. The first filmed scene 114A may include the pluralityof filmed shots 116A to 116C and the second filmed scene 114B mayinclude another plurality of filmed shots 116D to 116F. The plurality offilmed shots 116A to 116C of the first filmed scene 114A and theplurality of filmed shots 116D to 116F of the second filmed scene 114Bmay be of certain duration (for example in seconds or minutes). Inaccordance with an embodiment, the first filmed shot 116A may includethe sequence of image frames 118 and the second filmed shot 116B mayinclude the sequence of image frames 120. In accordance with anembodiment, a number of image frames in the first filmed shot 116A andthe second filmed shot 1168 may be based on a duration (or time length)of the first filmed shot 116A and the second filmed shot 1168. In someembodiments, the number of image frames may be based on a capture rate(or frame rate) of the video capturing device at the time of capture ofeach of the first filmed shot 116A and the second filmed shot 1168. Thefirst filmed shot 116A and the second filmed shot 1168 may be capturedby the video capturing device continuously over a time period. In someembodiments, the first filmed shot 116A and the second filmed shot 1168may be captured by the video capturing device in a discontinuous manner(for example captured after certain time period say in hours, days orweeks).

In accordance with an embodiment, the first filmed shot 116A and thesecond filmed shot 1168 may further include shot identificationinformation (for example a shot identification number). In accordancewith an embodiment, the video editing device 102 may be configured todetermine the first filmed shot 116A and the second filmed shot 1168 asconsecutive shots of the first filmed scene 114 based on the shotidentification information of each of the first filmed shot 116A and thesecond filmed shot 1168. In some embodiments, the video editing device102 may be configured to determine the first filmed shot 116A and thesecond filmed shot 1168 as the consecutive shots of the first filmedscene 114 based on clapper board information used during the capture ofeach of the first filmed shot 116A and the second filmed shot 1168. Theclapper board information may include, but is not limited to, date-timeinformation of capture, a scene number, a take number, a title of videocontent, or a name of a director of video content. In some embodiments,the video editing device 102 may be configured to receive one or moreuser inputs from the user 110 to select the first filmed shot 116A andthe second filmed shot 1168 as the consecutive shots before the receiptof the first user input to initiate the continuity check on the firstvideo content 112. In some embodiments, the video editing device 102 maybe configured to receive the one or more user inputs from the user 110to arrange a plurality of filmed shots of the set of filmed scenes on anediting timeline (not shown) displayed on the display screen 104 of thevideo editing device 102. The one or more user inputs may includeselection of the first filmed shot 116A and the second filmed shot 1168as the consecutive shots based on the arrangement. In accordance with anembodiment, the video editing device 102 may be configured toautomatically arrange and select the first filmed shot 116A and thesecond filmed shot 1168 as the consecutive shots based on the shotidentification information or the capture timing information.

In accordance with an embodiment, the video editing device 102 may befurther configured to extract the first image frame 122 of the firstfilmed shot 116A and the second image frame 124 of the second filmedshot 1168. In accordance with an embodiment, the first image frame 122may be the last frame of the sequence of image frames 118 of the firstfilmed shot 116A and the second image frame 124 may be the first frameof the sequence of image frames 120 of the second filmed shot 1168. Thevideo editing device 102 may perform the continuity check on the firstimage frame 122 of the first filmed shot 116A and the second image frame124 of the second filmed shot 116B.

In accordance with an embodiment, the video editing device 102 may befurther configured to detect a first set of objects (shown in FIG. 3)from the extracted first image frame 122 and a second set of objects(shown in FIG. 3) from the extracted second image frame 124. Examples ofthe first set of objects and the second set of objects may include, butare not limited to, humans, animals, plants, or other non-livingentities.

In accordance with an embodiment, the video editing device 102 may befurther configured to identify a first object from each of the first setof objects and the second set of objects based on an object type of thefirst object. The object type of the first object in the first set ofobjects and the second set of objects is same. The video editing device102 may be further configured to determine a deviation of a first visualattribute of the first object in the first image frame 122 from a secondvisual attribute of the first object in the second image frame 124. Thefirst visual attribute and the second visual attribute may be of sametype. Examples of the first visual attribute and the second visualattribute may include, but are not limited to, a shape, a size, adimension, a color, a brightness value, a contrast value, a texture, anorientation, an angle of tilt, a posture, or textual information. Thedeviation between the first visual attribute of the first object in thefirst image frame 122 and the second visual attribute of the firstobject in the second image frame 124 may be described in detail, forexample, FIG. 3.

The video editing device 102 may be configured to determine thedeviation to check the continuity of the first object in both the firstimage frame 122 and the second image frame 124. In accordance with anembodiment, the continuity of the first object may be determined basedon the presence of the first object with similar set of visualattributes in both the first image frame 122 and the second image frame124 of the first filmed shot 116A and the second filmed shot 1168respectively.

In accordance with an embodiment, the video editing device 102 may befurther configured to generate notification information associated withthe first object based on the determined deviation. In accordance withan embodiment, the video editing device 102 may be further configured tocontrol the display screen 104 to output the generated notificationinformation associated with the first object in the first image frame122 and the second image frame 124. The output notification informationmay be described in detail, for example in FIGS. 4A and 4B. Thus, thenotification information about the discontinuity of the first objectbetween the first image frame 122 and the second image frame 124 mayallow the user 110 (for example the editor) to take appropriatedecisions either to reshoot the first filmed shot 116A and the secondfilmed shot 116B or to apply one or more known image processingtechniques on the first object to maintain the continuity of the firstobject between the first filmed shot 116A and the second filmed shot116B. The user 110 may also take decisions to discard the notificationinformation in case the discontinuity of the first object is intentionalat the time of capture of the first filmed shot 116A and second filmedshot 116B.

In accordance with an embodiment, the video editing device 102 may beconfigured to perform the continuity check on different consecutive oradjoining filmed shots of each of the set of filmed scenes 114A to 114Bof the first video content 112 and generate the notification informationfor different objects based on the performed continuity check. Thus, thevideo editing device 102 may provide accurate and automatic detection ofcontinuity errors (or mistakes) in the captured first video content 112which further reduces a substantial manual effort of the user 110. Theaccurate and automatic detection of the continuity errors may furtherassist the user 110 to improve overall quality of the first videocontent 112 and enhance the credibility of the user 110 among theaudiences of the first video content 112.

FIG. 2 is a block diagram that illustrates an exemplary video editingdevice for control of notifications for deviations in depiction ofdifferent objects in filmed shots of video content, in accordance withan embodiment of the disclosure. FIG. 2 is explained in conjunction withelements from FIG. 1. With reference to FIG. 2, there is shown the videoediting device 102. The video editing device 102 may include circuitry202 which may further include the object detector 204. The video editingdevice 102 may further include a memory 206, network interface 208, andan input/output (I/O) device 210. The I/O device 210 may include thedisplay screen 104. The circuitry 202 may be communicatively coupled tothe memory 206, the network interface 208, and the I/O device 210. Thecircuitry 202 may be configured to communicate with the server 106, byuse of the network interface 208.

The circuitry 202 may comprise suitable logic, circuitry, and interfacesthat may be configured to execute a set of instructions stored in thememory 206. The circuitry 202 may be configured to receive the firstuser input (i.e. to perform the continuity check) from the user 110 viathe I/O device 210. The circuitry 202 may be further configured todetermine the continuity of different objects included in theconsecutive filmed shots in the filmed scene based on the received firstuser input. In accordance with an embodiment, the circuitry 202 may beconfigured to analyze different objects in the first video content 112to determine the factual errors. The circuitry 202 may be implementedbased on a number of processor technologies known in the art. Examplesof the circuitry 202 may be a Graphical Processing Unit (GPU), a CentralProcessing Unit (CPU), X86-based processor, a Reduced Instruction SetComputing (RISC) processor, an Application-Specific Integrated Circuit(ASIC) processor, a Complex Instruction Set Computing (CISC) processor,other processors, and the like.

The object detector 204 may comprise suitable logic, circuitry, andinterfaces that may be configured to detect the first set of objects andthe second set of objects from the first image frame 122 of the firstfilmed shot 116A and the second image frame 124 of the second filmedshot 1168 respectively. In accordance with an embodiment, the objectdetector 204 may be further configured to identify different objecttypes of the detected the first set of objects and the second set ofobjects. The detected first set of objects and the second set of objectsmay correspond to at least one of a two-dimensional (2D) object or athree-dimensional (3D) object. In accordance with an embodiment, theobject detector 204 may be configured to detect the first set ofobjects, the second set of objects, and related object types based onobject detection and classification technique. The object detection andclassification technique may be based on, but not limited to, numericalcomputation techniques using data flow graphs, a deep neural network(DNN) architecture, such as a convolutional neural network (CNN), aCNN-recurrent neural network (CNN-RNN), R-CNN, Fast R-CNN, Faster R-CNN,and (You Only Look Once) YOLO network. In some embodiments, the objectdetector 204 may implement other object identification techniques, suchas human shape-based object boundary identification, pre-specifiedshape-based or template based object identification, such as by use ofSobel operator or Prewitt operator. Examples of implementations of theobject detector 204 may be a specialized circuitry, a GPU, a RISCprocessor, an ASIC processor, a CISC processor, a microcontroller, acentral processing unit (CPU), or other control circuits.

In certain embodiments, the object detector 204 may be implemented basedon a DNN learning model trained for detection and identification ofobjects in image frames. In such implementations, the object detector204 may be a specialized DNN circuitry and/or may implement asupplementary accelerator circuitry, to boost training and/or speed ofdetection of objects in image frames captured by the video capturingdevice. The accelerator circuitry may be an on-device (offline)accelerator circuitry (not shown in FIG. 1) or a server-end (online)accelerator circuitry (i.e., available on a cloud server).

The memory 206 may comprise suitable logic, circuitry, and interfacesthat may be configured to store the first video content 112, the firsttimeline information which may indicate the era related to the firstvideo content 112. In accordance with an embodiment, the memory 206 maybe configured to store the notification information associated the firstobject for which the video editing device 102 determines thediscontinuity or the factual error. In some embodiments, the memory 206may perform functionalities similar to the data storage system 102Ashown in FIG. 1. Examples of implementation of the memory 206 mayinclude, but are not limited to, Random Access Memory (RAM), Read OnlyMemory (ROM), Electrically Erasable Programmable Read-Only Memory(EEPROM), Hard Disk Drive (HDD), a Solid-State Drive (SSD), a CPU cache,and/or a Secure Digital (SD) card.

The network interface 208 may comprise suitable logic, circuitry, and/orinterfaces that may be configured to facilitate communication betweenthe video editing device 102 and the server 106, via the communicationnetwork 108. The network interface 208 may be implemented by use ofvarious known technologies to support wired or wireless communication ofthe video editing device 102 with the communication network 108. Thenetwork interface 208 may include, but is not limited to, an antenna, aradio frequency (RF) transceiver, one or more amplifiers, a tuner, oneor more oscillators, a digital signal processor, a coder-decoder (CODEC)chipset, a subscriber identity module (SIM) card, or a local buffer.

The I/O device 210 may comprise suitable logic, circuitry, and/orinterfaces that may be configured to act as an I/O channel/interfacebetween the user 110 and the different operational components of thevideo editing device 102. The I/O device 110 may be configured toreceive the first user input from the user 110 to perform the continuitycheck on the first video content 112. The I/O device 110 may be furtherconfigured to display the notification information associated the firstobject for which the video editing device 102 determines thediscontinuity or the factual error. The I/O device 210 may comprisevarious input and output devices, which may be configured to communicatewith different operational components of the device 102. Examples of theI/O device 210 may include, but are not limited to, a touch screen, akeyboard, a mouse, a joystick, a microphone, and a display screen (forexample the display screen 104).

The functions or operations executed by the video editing device 102, asdescribed in FIG. 1, may be performed by the circuitry 202 and theobject detector 204. The operations executed by the circuitry 202 andthe object detector 204 are further described, for example, in the FIGS.3, 4A, 4B, 5, 6 and 7.

FIG. 3 illustrates a first exemplary scenario for control ofnotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure. FIG. 3 is explained inconjunction with elements from FIG. 1 and FIG. 2. With reference to FIG.3, there is shown a first scenario 300.

In the first scenario 300, there is shown the first image frame 122 ofthe first filmed shot 116A and the second image frame 124 of the secondfilmed shot 1168. In accordance with an embodiment, the first filmedshot 116A and the second filmed shot 1168 may correspond to same filmedscene (for example the first filmed scene 114A). The first filmed shot116A and the second filmed shot 1168 may be the consecutive or adjoiningshots of the first filmed scene 114A. In accordance with an embodiment,the first image frame 122 may be the last frame of the sequence of imageframes 118 of the first filmed shot 116A and the second image frame 124may be the first frame the sequence of image frames 120 of the secondfilmed shot 1168. In accordance with an embodiment, the first filmedshot 116A and the second filmed shot 116B may correspond to differentfilmed scenes of the first video content 112, where the different filmedscene may correspond to same context. For example, different scenes arecaptured at different time (or with different scripts or dialogues) butwith similar objects in background as the same context.

In FIG. 3, there is further shown the video editing device 102 which isconfigured to perform the continuity check on the first image frame 122and the second image frame 124 of the consecutive shots (the firstfilmed shot 116A and the second filmed shot 116B). The video editingdevice 102 may be configured to perform the continuity check to verifythe consistency (i.e. presence or visual attributes) of differentobjects between the first filmed shot 116A and the second filmed shot1168.

The circuitry 202 may be configured to initiate the continuity checkbetween each consecutive shots of the set of filmed scenes 114A to 114Bbased on the receipt of the first user input from the user 110 (forexample the editor of the first video content 112). In the continuitycheck, the circuitry 202 may be configured to extract the first imageframe 122 of the first filmed shot 116A and the second image frame 124of the second filmed shot 1168. In accordance with an embodiment, forthe extraction, the circuitry 202 may be configured to read pixelinformation associated with the first image frame 122 and the secondimage frame 124 from the memory 206 in which the first video content 112is stored. In accordance with an embodiment, the circuitry 202 may befurther configured to provide the extracted first image frame 122 andthe extracted second image frame 124 to the object detector 204.

In accordance with an embodiment, the object detector 204 may be furtherconfigured to detect a first set of objects (such as 302, 304A, 306A,308A, 310A, 312A, and 314A in FIG. 3) from the extracted first imageframe 122. The object detector 204 may be further configured to detect asecond set of objects (such as 304B, 306B, 308B, 310B, 312B, and 314B inFIG. 3) from the extracted second image frame 124. The object detector204 may be further configured to classify the detected objects intodifferent object types based on different object detection techniques asdescribed in FIG. 1. With respect to FIG. 3, the object detector 204 maybe configured to detect and classify a first object 304A with the objecttype as a human, a second object 306A with the object type as a dog, athird object 308A with the object type as a car, a fourth object 310Awith the object type as a lighting device, a fifth object 314A with theobject type as a box, and a sixth object 302 with the object type as aplant from the first set of objects in the first image frame 122. Theobject detector 204 may be further configured to detect and classify aseventh object 304B with the object type as the human, an eight object306B with the object type as the dog, a ninth object 308B with theobject type as the car, a tenth object 3108 with the object type as thelighting device, and an eleventh object 314B with the object type as thebox from the second set of objects in the second image frame 124. Theobject detector 204 may be configured to determine pixel coordinates ofeach of the first set of objects and the second set of objects in thefirst image frame 122 and the second image frame 124 respectively. Thepixel coordinates may be in X-axis and Y-axis based on a resolution atwhich the first image frame 122 and the second image frame 124 arecaptured by the video capturing device.

In accordance with an embodiment, the circuitry 202 may be configured toreceive the detected object type and the determined pixel coordinates ofeach of the first set of objects and the second set of objects from theobject detector 204. The circuitry 202 may be further configured toidentify the first object 304A in the first image frame 122 and theseventh object 304B in the second image frame 124 as same objects basedon the object type and the determined pixel coordinates of both thefirst object 304A and the seventh object 304B. In accordance with anembodiment, the object detector 204 may be configured to detect andclassify different sub-objects within detected objects. For example, theobject detector 204 may be configured to detect a twelfth object 314 asinsignia within the detected first object 304A (as human) in the firstimage frame 122. In accordance with an embodiment, the circuitry 202 maybe configured to receive the twelfth object 314 in the first set ofobjects from the object detector 204. The circuitry 202 may beconfigured to determine that the twelfth object 314 (for exampleinsignia) in the first set of objects is missing or absent in thedetected second set of objects based on the object type and the pixelcoordinates of the twelfth object 314 in the first image frame 122. Thecircuitry 202 may be further configured to generate the notificationinformation for the user 110. The notification information may indicatethe discontinuity or inconsistency of the twelfth object 314 between thefirst image frame 122 (for example last frame of the first filmed shot116A) and the second image frame 124 (for example first frame of thesecond filmed shot 116B). The notification information may be an alertor an error message about the discontinuity of the twelfth object 314(for example insignia) for the user 110. The circuitry 202 may beconfigured to control the display screen 104 to output the generatednotification information associated with the twelfth object 314. Thedetails of the notification information may be described in detail, forexample, in FIGS. 4A and 4B. In accordance with an embodiment, thecircuitry 202 may be configured to control a speaker (not shown)associated with the video editing device 102 to output the generatednotification in an audio form.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine one or more visual attributes of the detectedfirst set of objects and the second set of objects. In some embodiments,the circuitry 202 may be configured to determine the one or more visualattributes based on the pixel information of each pixel in the firstimage frame 122 and the second image frame 124. The pixel information ofmay include, but are not limited to, a brightness value, a contrastvalue, or a color value. In some embodiments, the circuitry 202 may beconfigured to determine the one or more visual attributes based on thedetermined pixel coordinates of each of the first set of objects and thesecond set of objects. Examples of the one or more visual attributes ofthe detected first set of objects and the second set of objects mayinclude, but are not limited to, a shape, a size, a dimension, a color,a brightness value, a contrast value, a texture, an orientation, anangle of tilt, a posture, or textual information.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine the one or more visual attributes for each ofthe first object 304A and the seventh object 304B (i.e. same objects).The circuitry 202 may be configured to determine a deviation in at leastone visual attribute of the determined one or more visual attributes foreach of the first object 304A and the seventh object 304B. With respectto FIG. 3, the one or more visual attributes of the first object 304Aand the seventh object 304B are same. Thus, the circuitry 202 may beconfigured to detect the consistency (or continuity) in the one or morevisual attributes of the first object 304A and the seventh object 304Bbetween the first image frame 122 and the second image frame 124.

The circuitry 202 may be further configured to identify the secondobject 306A in the first image frame 122 and the eight object 306B inthe second image frame 124 as same objects (for example dog) based onthe object type and the determined pixel coordinates of both the secondobject 306A and the eight object 306B. The circuitry 202 may be furtherconfigured to determine the one or more visual attributes of the secondobject 306A and the eight object 306B. With respect to FIG. 3, thecircuitry 202 may determine the deviation in size between the secondobject 306A and the eight object 306B. The circuitry 202 may be furtherconfigured to generate the notification information associated with thesecond object 306A (or the eight object 306B) based on the deviation insize. The deviation in size between the second object 306A and the eightobject 306B may indicate the discontinuity or inconsistency between thefirst image frame 122 of the first filmed shot 116A and the second imageframe 124 of the second filmed shot 1168.

Similarly, with respect to FIG. 3, the circuitry 202 may determine adeviation in shape between the third object 308A (as car) in the firstimage frame 122 and the ninth object 308B (similar to the third object308A) in the second image frame 124. The circuitry 202 may be furtherconfigured to determine a deviation in color between the fourth object310A (as the lighting device) in the first image frame 122 and the tenthobject 3108 in the second image frame 124.

In accordance with an embodiment, the object detector 204 may beconfigured to detect textual information in the detected first set ofobjects and the second set of objects. In accordance with an embodiment,the object detector 204 may detect the textual information based on oneor more text recognition or optical character recognition techniques.The object detector 204 may be further configured to provide thedetected textual information in each of the first set of objects and thesecond set of objects to the circuitry 202. The circuitry 202 may befurther configured to compare the textual information of similar objectsin the first image frame 122 and the second image frame 124 to determinea deviation in the textual information. With respect to FIG. 3, thecircuitry 202 may determine a deviation between first textualinformation 312A (for example “COP1017X”) in the third object 308A andsecond textual information 312B (for example “COP1022Y”) in the ninthobject 308B. The circuitry 202 may be further configured to generate thenotification information associated with the third object 308A (or theninth object 308B) based on the determined deviation between the firsttextual information 312A and the second textual information 3128.Further, with respect to FIG. 3, the circuitry 202 may determine thatthe sixth object 302 (as plant) in the first set of objects is absent ormissing in the second set of objects based on the detected object typeof each of the first set of objects and the second set of objects. Thecircuitry 202 may be configured to generate the notification informationto indicate the discontinuity of the sixth object 302 between the firstimage frame 122 and the second image frame 124. Thus, the video editingdevice 102 may provide automatic and accurate generation of thenotification information for the discontinuities (either in presence orin at least one visual attributes) of the objects between theconsecutive shots. The automatic and accurate generation of thenotification information ensures considerable time saving in theidentification of the discontinuities (or the continuity errors) andfurther ensures improvement in the overall quality of the first videocontent 112.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine a position deviation of each of the first set ofobjects in the first image frame 122 with respect to similar object inthe second set of objects in the second image frame 124. The circuitry202 may be configured to determine the position deviation based on thepixel coordinates of each of the first set of objects and the second setof objects. With respect to FIG. 3, the circuitry 202 may be configuredto identify the position deviation between the fifth object 314A (asbox) in the first image frame 122 and the eleventh object 314B (similarto the fifth object 314A). The position deviation between the fifthobject 314A and the eleventh object 314B may indicate a change inposition or movement of the fifth object 314A (or the eleventh object314B) between the first filmed shot 116A and the second filmed shot116B. In accordance with an embodiment, the circuitry 202 may beconfigured to generate the notification information about the fifthobject 314A (or the eleventh object 314B) based on the detected positiondeviation. The circuitry 202 may be further configured to control thedisplay screen 104 to display the notification information about theposition deviation to the user 110. Based on the notificationinformation about the position deviation, the user 110 may take decisionto reshoot the second filmed shot 116B by correction of the position ofthe eleventh object 314B and further avoid position discontinuitybetween the fifth object 314A and the eleventh object 314B. The user 110may use different image processing or image editing techniques tocorrect the position of the eleventh object 314B in the second imageframe 124. The user 110 may also ignore the notification informationabout the fifth object 314A (or the eleventh object 314B) in case, theposition deviation between the fifth object 314A and the eleventh object314B was intentional at the time of capture of the second filmed shot116B. For example, a director or a camera man may change a position orlocation of a particular object between the consecutive shotsintentionally based on a script of the first filmed scene 114A while thecapture of the first video content 112. In another example, in case thedirector or the camera man changes a zoom of the video capturing deviceat the time of capture of the second filmed shot 116B with respect tothe first filmed shot 116A, then the deviation in the one or more visualattributes (for example size or dimension) may be considered intentionalby the user 110.

In accordance with an embodiment, the circuitry 202 may be configured todetermine the position deviation between each of the first set ofobjects and the corresponding similar objects of the second set ofobjects based on the context of the first filmed scene 114A (i.e. whichincludes the first filmed shot 116A and the second filmed shot 116B).The circuitry 202 may be configured to analyze a plurality of imageframes previous to the first image frame 122 (or a plurality of filmedshots previous to the first filmed shot 116A) to determine the contextof the first filmed scene 114A. In accordance with an embodiment, thecircuitry 202 may be configured to determine the pixel coordinates ofdifferent objects in the plurality of image frames to determine thecontext of the first filmed scene 114A. For example, in case of a movingscene (such as a dance or a song scene as context) where frequentmovement in the objects and the sub-objects are captured between theconsecutive shots, the circuitry 202 may be configured to discard theposition deviation (between the first set of objects and thecorresponding similar objects of the second set of objects) to generatethe notification information for the user 110. Thus, the video editingdevice 102 may provide the intelligent continuity check of the objectsbetween the consecutive shots. Such intelligent continuity checkperformed by the video editing device 102 may provide accuratecontinuity errors to the user 110 in lesser time. The user 110 mayfurther analyze and correct all the continuity errors to improve thequality of the first video content 112 finally released after editingphase.

FIGS. 4A and 4B, collectively, illustrate a user interface to displaynotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure. FIGS. 4A and 4B areexplained in conjunction with elements from FIGS. 1, 2 and 3. Withreference to FIG. 4A, there is shown a first user interface (UI) 400Adisplayed on the display screen 104.

In accordance with an embodiment, the first user interface (UI) 400A mayindicate the notification information generated based on the continuitycheck executed by the circuitry 202 on the consecutive shots of the setof filmed scenes 114A to 114B of the first video content 112. Withrespect to FIG. 4A, the first user interface 400A may indicate thenotification information generated based on the continuity checkexecuted between the first set of objects and the second set of objectsin the first image frame 122 and the second image frame 124respectively, as described in detail, for example, in FIG. 3. Thenotification information may indicate the discontinuity or theinconsistency between the first set of objects and the second set ofobjects in the first image frame 122 and the second image frame 124. Inaccordance with an embodiment, the notification information may includeidentifier of the first filmed scene 114A in which one or morediscontinuities of objects are detected by the video editing device 102.The identifier of the first filmed scene 114A may indicate a scenenumber in the first video content 112. In some embodiments, thenotification information may further include identification informationof the first filmed shot 116A and the second filmed shot 116B betweenwhich the one or more discontinuities of objects are detected by thevideo editing device 102. The identification information may indicate ashot number of the first filmed scene 114A which may include the firstfilmed shot 116A and the second filmed shot 116B. In accordance with anembodiment, the notification information may include details of thediscontinuities (or the continuity errors). The details of thediscontinuities may include the object type and the deviation in the oneor more visual attributes of the first set of objects and the second setof objects.

With respect to FIG. 4A, there is shown a plurality of UI options 402Ato 402N which may indicate the generated notification informationdisplayed in a tabular form on the display screen 104. Each of theplurality of UI options 402A to 402N may display the generatednotification information. In accordance with an embodiment, each of theplurality of UI options 402A to 402N may indicate the identifier of thefirst filmed scene 114A, the identification information of the firstfilmed shot 116A and the second filmed shot 1168, and the details of thedetected discontinuities.

In accordance with an embodiment, the plurality of UI options 402A to402N may include a first UI option 402A which may indicate thediscontinuity about the twelfth object 314 (for example insignia)between the first image frame 122 and the second image frame 124 asdescribed in FIG. 3. For example, the first UI option 402A may indicatean error message which may include the details of the continuity errorswith the twelfth object 314 as “Missing Insignia on Uniform” in a filmedscene number “1” (as the identifier of the first filmed scene 114A) andin between the consecutive shots with number 1 and 2 (as theidentification information of the first filmed shot 116A and the secondfilmed shot 116B).

Similarly, the plurality of UI options 402A to 402N may include a secondUI option 402B which may include the error message. The error messagemay indicate the discontinuity (i.e. deviation in size) associated withthe second object 306A and the eight object 306B (with the object typeas dog) as described in FIG. 3. The error message may also indicate aproduction error in depiction of the eight object 306B in the secondimage frame 124. For example, the second UI option 402B may indicate thedetails of the continuity error of the eight object 306B as “Size of dogis different” with the identifier of the first filmed scene 114A as “1’and the identification information of the first filmed shot 116A and thesecond filmed shot 1168 as “1 and 2”. With respect to FIG. 4A, theplurality of UI options 402A to 402N may include a third UI option 402C,a fourth UI option 402D, a fifth UI option 402E, and a sixth UI option402F. In accordance with an embodiment, the third UI option 402C mayinclude the error message that may indicate the discontinuity (i.e.deviation in shape and textual information) associated with the thirdobject 308A and the ninth object 308B (as car) between the first imageframe 122 and the second image frame 124. For example, the third UIoption 402C may indicate the details of the discontinuity of the thirdobject 308A and the ninth object 308B as “Shape and text of car aredifferent”. The fourth UI option 402D may include the error message thatmay indicate the discontinuity (i.e. deviation in color) associated withthe fourth object 310A and the tenth object 310B (as the lightingdevice). For example, the fourth UI option 402D may indicate the detailsof the discontinuity of the fourth object 310A and the tenth object 310Bas “Color of lighting device over car is different”. The fifth UI option402E may indicate the details of the continuity error of the sixthobject 302 as “Missing plant” with the filmed scene number as 1″ and theconsecutive shots as “1, 2”. Further, the sixth UI option 402F mayinclude the error message that may indicate the discontinuity (i.e.deviation in position) associated with the fifth object 314A and theeleventh object 314B (as the box). For example, the sixth UI option 402Fmay indicate the details of the discontinuity of the fifth object 314Aand the eleventh object 314B as “Position of box is different”.

In accordance with an embodiment, the circuitry 202 may be configured toadd the notification information as metadata in the first video content112. In some embodiments, the circuitry 202 may be configured to add thenotification information as the metadata with the first filmed shot 116Aor the second filmed shot 1168 which includes the inconsistent objectsfor which the notification information was generated. The added metadatain the first video content 112 may assist the user 110 to quickly referdifferent continuity mistakes (or inconsistent objects) while editing orrendering the first video content 112 at later stages.

In accordance with an embodiment, the circuitry 202 may be configured toreceive the one or more user inputs from the user 110 to select one ofthe plurality of UI options 402A to 402N, via the I/O device 210. Thecircuitry 202 may be further configured to control the display screen104 to display the corresponding shots or the corresponding image frameassociated with the selected one of the plurality of UI options 402A to402N. For example, in case the user 110 selects the third UI option402C, the circuitry 202 may control the display screen 104 to displaythe first image frame 122 and the second image frame 124 which includethe discontinuity (i.e. deviation in shape and text) of the third object308A (or similar ninth object 308B) with the object type as car.

In accordance with an embodiment, the circuitry 202 may be configured toreceive the one or more user inputs from the user 110, via the I/Odevice 210, to filter the displayed the plurality of UI options 402A to402N. The circuitry 202 may be configured to filter the plurality of UIoptions 402A to 402N based on different factors. Examples of the factorsmay include, but are not limited to, scene number, shot number, objecttype, or one or more visual attributes. For example, the circuitry 202may receive the one or more user inputs from the user 110 to filter anddisplay one or more of the plurality of UI options 402A to 402N (i.e.generation notification information) for scene 1. In another example,the circuitry 202 may receive the one or more user inputs from the user110 to filter and display one or more of the plurality of UI options402A to 402N which include deviation in position, or missing objects, orhuman as object type. Thus, the display of selective discontinuities ofthe first video content 112 may assist the user 110 to identify actualcontinuity mistakes which should be corrected to enhance the quality andpopularity of the first video content 112.

With respect to FIG. 4B, there is shown the generated notificationinformation in a graphical form on the either of the first image frame122 and the second image frame 124. In FIG. 4B, there is a shown aplurality of UI regions 404A to 404F as the notification information.The plurality of UI regions 404A to 404F may be a graphical overlay (forexample a transparent boundary or region) over the first set of objectsand the second set of objects in which the continuity errors have beendetected or the notification information is generated by the circuitry202 as described in detail, for example, in FIG. 3. Thus, such graphicalrepresentation of the notification information may assist the user 110(as the editor) to identify the continuity errors in lesser time. Basedon the displayed graphical representation of the notificationinformation associated with different objects, the user 110 may furthertake appropriate decision either to reshoot the detected shot (with thecontinuity errors), correct the continuity errors using the known videoediting techniques or just ignore the continuity error of a particularobject (in case the discontinuity or the deviation was intentional atthe time of capture of the shot).

FIG. 5 illustrates a second exemplary scenario for control ofnotifications for deviations in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure. FIG. 5 is explained inconjunction with elements from FIGS. 1, 2, 3, 4A, and 4B. With referenceto FIG. 5, there is shown a second scenario 500.

In the second scenario 500, there is shown the first image frame 122 andthe second image frame 124 on which the continuity check may beperformed by the video editing device 102. The first image frame 122 mayinclude a first facial object 502A (with the object type as human face)and the second image frame 124 may include a second facial object 502B(similar to the first facial object 502A). With respect to FIG. 5, thefirst image frame 122 and the second image frame 124 may be captured bythe video capturing device at different angles with respect to the firstfacial object 502A. For example, the first image frame 122 in the firstfilmed shot 116A may be captured from a right-side (for example atzero-degree angle) of the first facial object 502A and the second imageframe 124 in the second filmed shot 116B may be captured from afront-side (for example at ninety-degree angle) of the second facialobject 502B.

In accordance with an embodiment, the first image frame 122 may includefirst angle information (for example zero-degree) which may indicate afirst angle at which the first image frame 122 may be captured by thevideo capturing device. In accordance with an embodiment, the videocapturing device may be at a first position (for example at a right-sideof the first facial object 502A) to capture the first image frame 122.The second image frame 124 may include second angle information (forexample ninety-degree) which may indicate a second angle at which thesecond image frame 124 may be captured by the video capturing device. Inaccordance with an embodiment, the video capturing device may be at asecond position (for example at front-side of the second facial object502B) to capture the second image frame 124. The second angle (at whichthe second image frame 124 may be captured) may be defined with respectto the first angle (at which the first image frame 122 may be captured).In accordance with an embodiment, each of the sequence of image frames118 of the first filmed shot 116A may include the first angleinformation, and each of the sequence of image frames 120 of the secondfilmed shot 1168 may include the second angle information.

In accordance with an embodiment, the object detector 204 may beconfigured to detect the first facial object 502A from the first imageframe 122. The circuitry 202 may be configured to receive the detectedfirst facial object 502A from the object detector 204. In accordancewith an embodiment, the circuitry 202 may be further configured togenerate a first three-dimensional (3D) image of the detected firstfacial object 502A based on the first angle information included in thefirst image frame 122. In some embodiments, the circuitry 202 maygenerate the first 3D image of the first facial object 502A usingdifferent 2D-3D conversion techniques. Example of such 2D-3D conversiontechniques may include, but are not limited to, 3D reconstruction frommultiple images using discrete linear transform (DLT), 3D reconstructionfrom 2D images using deep neural networks, 3D Reconstruction from 2Dimages using triangulation, or 3D reconstruction from monocular andstereo cues in 2D images.

In accordance with an embodiment, the object detector 204 may be furtherconfigured to detect the second facial object 502B from the second imageframe 122. The circuitry 202 may be configured to receive the detectedsecond facial object 502B from the object detector 204. In accordancewith an embodiment, the circuitry 202 may be further configured togenerate a second 3D image of the detected second facial object 502Bbased on the second angle information included in the second image frame124.

In accordance with an embodiment, each of the sequence of image frames118 may include the first angle information which indicate the firstangle at which each of the sequence of image frames 118 are captured bythe video capturing device. In some embodiments, a first plurality ofimage frames of the sequence of image frames 118 may be captured at adifferent angle in comparison to a second plurality of image frames ofthe sequence of image frames 118. For example, in case, the videocapturing device is continuously moving at a particular direction(either in X-axis, Y-axis, or Z-axis) and with a particular speed, thenthe first plurality of image frames of the first filmed shot 116A may becaptured at different angles with respect to the capture of the secondplurality of image frames of the first filmed shot 116A. In accordancewith an embodiment, a number of first plurality of image frames and anumber of the second plurality of image frames captured at differentangles may be based on the frame (or capture) rate of the videocapturing device and the particular speed at which the video capturingdevice moves while capture of the first filmed shot 116A.

In accordance with an embodiment, each of the sequence of image frames118 may include first motion information of the video capturing device.The first motion information may indicate a change in position of thevideo capturing device with respect to a first original position (forexample a position of the video capturing device while capture of afirst image frame of the sequence of image frames 118). The position maycorrespond to XYZ position of the video capturing device in a 3Dphysical space.

In accordance with an embodiment, the circuitry 202 may be configured togenerate the first 3D image of the detected first facial object 502A (orthe first set of objects in FIG. 3) based on the first angle informationor the first motion information associated with each of the sequence ofimage frames 118. Similarly, each of the sequence of image frames 120 ofthe second filmed shot 1168 may include second motion information of thevideo capturing device. The second motion information may indicate thechange in the position of the video capturing device with respect to asecond original position (for example a position of the video capturingdevice while capture of a first image frame of the sequence of imageframes 120 of the second filmed shot 116B). In accordance with anembodiment, the circuitry 202 may be further configured to generate thesecond 3D image of the detected second facial object 502B (or the secondset of objects in FIG. 3) based on the second angle information or thesecond motion information associated with each of the sequence of imageframes 120 of the second filmed shot 1168.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to compare the generated first 3D image of the first facialobject 502A (i.e. captured at the first angle) with the generated second3D image of the second facial object 502B (i.e. captured at the secondangle). The circuitry 202 may be further configured to detect adeviation in the one or more visual attributes of the generated first 3Dimage of the first facial object 502A and the generated second 3D imageof the second facial object 502B. With respect to FIG. 5, the circuitry202 may be configured to detect the deviation in texture or shape (forexample texture or shape of blood) of the generated first 3D image ofthe first facial object 502A from the generated second 3D image of thesecond facial object 502B. With respect to FIG. 5, the texture or theshape of a first sub-object 504A (for example with the object type asblood mark) on the first facial object 502A is different from thetexture or the shape of a second sub-object 504B (similar to the firstsub-object 504A) on the second facial object 502B. Further, a thirdsub-object 506 (for example with the object type as blood mark) on thesecond facial object 502B in the second image frame 124 may be missingon the first facial object 502A in the first image frame 122.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to generate the notification information based on thedeviation of the texture (or shape) of the first sub-object 504A fromthe second sub-object 504B. In some embodiments, the circuitry 202 maybe configured to generate the notification information based on thedetection of discontinuity (or absence) of the third sub-object 506between the consecutive frames (i.e. the first image frame 122 and thesecond image frame 124). Thus, the 3D generation and comparisoncapability of the video editing device 102 provides automatic detectionof discontinuities of objects even-though the objects were captured atdifferent angles in the consecutive shots.

FIG. 6 illustrates a third exemplary scenario for control ofnotifications for factual errors in depiction of different objects infilmed shots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure. FIG. 6 is explained inconjunction with elements from FIGS. 1, 2, 3, 4A, 4B, and 5. Withreference to FIG. 6, there is shown a third scenario 600.

In the third scenario 600, there is shown a captured image frame 602 ofa particular filmed shot of the first video content 112. The image frame602 may include a first object 604 and a second object 606. Inaccordance with an embodiment, the first object 604 may correspond to anobject type as a human (for example a news reporter) and the secondobject 606 may correspond to an object type as a handled microphone heldby the first object 604. In FIG. 6, there is also shown text informationprinted on the second object 606. For example, the text information maycorrespond a name of organization (such as a news channel name) to whichthe first object 604 (for example the news reporter) may be associated.

In accordance with an embodiment, the circuitry 202 may be configured toreceive a second user input from the user 110, via the I/O device 210,during the editing phase of the first video content 112. The second userinput may include a request to perform a factual error check on thefirst video content 112. In response to the receipt of the second userinput, the circuitry 202 may be configured to execute the factual errorcheck on each image frame (for example the image frame 602) or eachfilmed shot of the first video content 112.

With respect to FIG. 6, the circuitry 202 may be configured to extractthe image frame 602 from the first video content 112 stored in thememory 206. The object detector 204 may be configured to detect theobject type of each of the first object 604 and the second object 606from the extracted image frame 602. The object detector 204 may befurther configured to detect the text information (for example “TopChannel” as shown in FIG. 6) associated with the second object 606 fromthe extracted image frame 602. In accordance with an embodiment, thecircuitry 202 may be configured to receive the detected object type ofeach of the first object 604 and the second object 606, and the textinformation from the object detector 204. The circuitry 202 may befurther configured to retrieve the first timeline information associatedwith the first video content. The first timeline information mayindicate the era related to the first video content 112 or the era (forexample a medieval era) depicted by the first video content 112. Inaccordance with an embodiment, the first timeline information mayindicate a year or a decade based on which the first video content 112may be captured. In some embodiments, the circuitry 202 may beconfigured to receive the first timeline information from the user 110,via the I/O device 210. The first timeline information may be includedin the second user input received from the user 110.

In accordance with an embodiment, the circuitry 202 may be configured tosend an image and the object type of each of the detected first object604 and the second object 606, and the detected text information to theserver 106. In accordance with an embodiment, the server may include amaster database (not shown) which may store a plurality of objects withdifferent object types. In some embodiments, the master database mayinclude different versions of the plurality of objects. Differentversions may indicate different changes (for example visual features)with respect to different objects. For example, the master database maystore different object types for different television or mobile phoneversions. In accordance with an embodiment, the master database mayinclude the second timeline information associated with the plurality ofobjects, the object types or different versions. The second timelineinformation may indicate timing information of invention, discovery,use, market release, or evolution of the each of the plurality ofobjects, the object types or different versions. For example, the secondtimeline information of a mobile phone (or specific version or model)may indicate a year when the mobile phone was invented, first releasedin market, or used, or evolved. With respect to FIG. 6, the masterdatabase in the server 106 may store the second timeline informationassociated with the second object 606 (for example the handheldmicrophone) or associated with detected text information (for example“Top Channel”).

In accordance with an embodiment, the server 106 may be configured toreceive the image and the object type of each of the detected firstobject 604 and the second object 606, and the detected text informationfrom the circuitry 202. The server 106 may be further configured tocompare the detected first object 604, the second object 606, and thedetected text information with the stored plurality of objects (ordifferent versions) based on the object type of each of the detectedfirst object 604 and the second object 606 and the stored object type ofeach of the stored plurality of objects. The server 106 may be furtherconfigured to identify an object from the stored plurality of objectsbased on the comparison where the object type of the identified objectis similar to the object type of the received first object 604 (or thesecond object 606 with the associated text information). The server 106may be further configured to retrieve the second timeline information ofthe identified object from the master database.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to receive the second timeline information of the identifiedobject from the server 106. In some embodiments, the memory 206 may beconfigured to store images of the plurality of objects (or differentversions), the associated object types and the associated secondtimeline information. The circuitry 202 may be configured to identifythe object from the stored plurality of objects based on the comparisonof the detected first object 604, the second object 606, and thedetected text information with the stored plurality of objects (ordifferent versions) based on corresponding object types. The circuitry202 may be further configured to retrieve the second timelineinformation associated with the identified object.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to compare the first timeline information associated with thefirst video content 112 with the second timeline information associatedwith each of the detected first object 604 and the second object 606 inthe extracted image frame 602. The circuitry 202 may be furtherconfigured to determine a deviation in the first timeline informationfrom the second timeline information for each of the detected firstobject 604 and the second object 606. For example, in case the firsttimeline information indicates that the first video content 112 isrelated to the era (for example 1900), and the second timelineinformation of the second object 606 indicates that “handheldmicrophone” was used later (for example 1950), the circuitry 202 mayidentify the deviation between the first timeline information of thefirst video content 112 and the second timeline information of thesecond object 606 as the factual error in the image frame 602. Inanother example, in case, the second timeline information associatedwith the detected text information indicates that the “Top Channel” wasstarted in 1970, the circuitry 202 may identify the deviation betweenthe first timeline information of the first video content 112 from thesecond timeline information of the second object 606 as the factualerror.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to generate the notification information associated with thesecond object 606 for which the factual error was detected. Inaccordance with an embodiment, the generated notification informationmay include the identifier of a filmed scene which includes the imageframe 602, the identification information of the filmed shot whichincludes the image frame 602, and the details of the factual error ofthe second object 606. In accordance with an embodiment, the circuitry202 may be configured to display the generated notification informationon the display screen for the user 110 as shown in FIG. 4A. In someembodiments, the circuitry 202 may be configured to control the displayscreen 104 to display the graphical overlay (as shown in FIG. 4B) overthe second object 606 for which the factual error was detected.

In accordance with an embodiment, the circuitry 202 may be configured todetect the factual errors for each detected object included in eachimage frame of each filmed shot of the first video content 112. Thecircuitry 202 may be further configured to generate the notificationinformation for each object for which the circuitry 202 detects thefactual error. Based on the generated notification information about thefactual error, the user 110 (for example editor) may take decisioneither to reshoot the filmed shot by removal of the object for which thefactual error was detected or edit the object using the known imageprocessing techniques before the finalization of the first video content112 for the audiences. Thus, the detection of the factual errors by thevideo editing device 102 may enhance the overall quality of the firstvideo content and the credibility of the user 110 (as editor).

FIG. 7 illustrates a fourth exemplary scenario for control ofnotifications for duplicacy in depiction of different objects in filmedshots of video content by the video editing device of FIG. 2, inaccordance with an embodiment of the disclosure. FIG. 7 is explained inconjunction with elements from FIGS. 1, 2, 3, 4A, 4B, 5, and 6. Withreference to FIG. 7, there is shown a fourth scenario 700.

In the fourth scenario 700, there is shown a captured image frame 702 ofa particular filmed shot of the first video content 112. The image frame702 may include a first object 704, a second object 706, and a thirdobject 708. The first object 704 may correspond to the object type as asun, the second object 706 may correspond to the object type as a human(for example a celebrity), and the third object 708 may correspond tothe object type as a mountain. In accordance with an embodiment, thedetected object type of each of the first object 704, the second object706, and the third object 708 may be a first metadata. In someembodiments, the image frame 702 may include details associated with thedetected first object 704, the second object 706, and the third object708 as the first metadata. In some embodiments, the circuitry 202 may beconfigured to determine the details as the first metadata based on theanalysis of the detected objects. Examples of the details, may include,but are not limited to, a size of an object, a dimension of an object,an object type of an object, a geo-location at which the image frame 702was captured, a posture of an object, an orientation of an object, acolor of an object, a texture of an object, or a shape of an object. Insome embodiments, the first metadata may include artist information (forexample name of a celebrity or a person) in case the detected objecttype is human.

In accordance with an embodiment, the first metadata associated with thefilmed shot or the image frame 702 may be stored in the memory 206. Inaccordance with an embodiment, the circuitry 202 may be configured toreceive a third user input from the user 110, via the I/O device 210,during the editing phase of the first video content 112. The third userinput may include a request to perform a duplicacy check on the firstvideo content 112. The user 110 (for example the editor) may conduct theduplicacy check on the first video content 112 to confirm whether thecaptured filmed shots or the plurality of objects in the filmed shots ofthe first video content 112 are not included in any other video content(for example other movies) already released. Based on the duplicacycheck, the video editing device 102 may ensure that the captured filmedshots and the included set of objects (or combination) are newlyintroduced in the first video content 112 which may further enhance thecreditability and the popularity of the first video content 112

In accordance with an embodiment, in response to the receipt of thethird user input, the circuitry 202 may be configured to retrieve thestored first metadata associated with the image frame 702 (or the filmedscene which includes the image frame 702). In some embodiments, thecircuitry 202 may be configured to determine the first metadataassociated with the detected objects in the image frame 702. Inaccordance with an embodiment, the circuitry 202 may be furtherconfigured to retrieve a second metadata from the server 106. The secondmetadata may be associated with a plurality of filmed shots of secondvideo content (i.e. different from the first video content 112). Forexample, the second video content may correspond to previous or oldermovies already been released to viewers. In some embodiments, the secondvideo content may correspond a content or movie production organization(for example a movie banner) which may be responsible for capture of thefirst video content 112.

The second metadata may include information related to the plurality offilmed shots of the second video content. The information related to thesecond video content may include, but are not limited to, informationabout objects in the plurality of filmed shots, artist information ofthe plurality of filmed shots, or geo-location information at which theplurality of filmed shots are captured. In accordance with anembodiment, the circuitry 202 may be configured to retrieve the secondmetadata for each of the plurality of filmed shots of the second videocontent. The circuitry 202 may be further configured to compare thefirst metadata (i.e. associated with the filmed shot which includes theimage frame 702) with the second metadata for each of the plurality offilmed shots of the second video content. In case, the first metadatamatches with the second metadata for one of the plurality of filmedshots of the second video content, the circuitry 202 may determine thatthe captured filmed shot (or the first object 704, the second object706, the third object 708, or the combination) in the first videocontent 112 may be present or already been captured/used in the secondvideo content (for example older movies). This indicates the duplicacyof the filmed shot (or the first object 704, the second object 706, thethird object 708, or the combination) between the first video content112 and the second video content.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to generate the notification information for the user basedon identification of the duplicacy. In some embodiments, the generatednotification information may indicate the error message that mayindicate a reproduction of a filmed scene or the filmed shot of thefirst video content 112 in the second video content. The circuitry 202may be further configured to control the display screen 104 to displaythe notification information generated based on the duplicacy check. Inaccordance with an embodiment, the notification information may includethe identifier of the filmed scene, the identification information ofthe filmed shot, and the details of the objects which were foundduplicate in comparison with the second video content.

In accordance with an embodiment, the circuitry 202 may be configured toconduct the duplicacy check for each of the plurality of filmed shots ofthe first video content 112. In some embodiments, the circuitry 202 maybe configured to retrieve the second metadata associated with aplurality of videos or movies stored in the server 106 to conduct theduplicacy check. Thus, with the duplicacy check capability, the videoediting device 102 may provide automatic alerts to the user 110 aboutthe duplicate shots (or combination of same objects in the shot). Basedon the automatic alerts, the user 110 may further take appropriateactions (i.e. either to reshoot the shot or edit the duplicate objects)before finalization of the first video content 112. This furtherprovides novelty to the first video content 112 and preventsincorporation of similar scene or shot from the previous video content.Thus, the popularity of the first video content 112 among the audiencesmay be enhanced.

FIGS. 8A and 8B, collectively, depict a flowchart that illustratesexemplary operations for control of notifications for deviations indepiction of different objects in filmed shots of video content, inaccordance with an embodiment of the disclosure. FIGS. 8A and 8B areexplained in conjunction with elements from FIGS. 1, 2, 3, 4A, 4B, 5, 6,and 7. With reference to FIGS. 8A and 8B, there is shown a flowchart800. The operations from 802 to 848 may be implemented on the videoediting device 102. The operations starts at 802 and proceeds to 804.

At 804, first video content 112 that may include a set of filmed scenes114A to 114B of a filmed sequence may be stored, each of the set offilmed scenes 114A to 114B may include a plurality of filmed shots andeach of the plurality of filmed shots may include a sequence of imageframes. In accordance with an embodiment, the memory 206 may beconfigured to store the first video content 112 that may include the setof filmed scenes 114A to 114B of the filmed sequence. In accordance withan embodiment, the video editing device 102 may be configured toretrieve the first video content 112 from the memory 206. In someembodiments, the video editing device 102 may be configured to retrievethe first video content 112 from the server 106.

At 806, a first image frame 122 of a first filmed shot 116A of theplurality of filmed shots and a second image frame 124 of a secondfilmed shot 116B of the plurality of filmed shots may be extracted. Inaccordance with an embodiment, the circuitry 202 may be configured toextract the first image frame 122 of the first filmed shot 116A of theplurality of filmed shots and the second image frame 124 of the secondfilmed shot 116B of the plurality of filmed shots. The first image frame122 may be a last frame of the first filmed shot 116A and the secondimage frame may be a first frame of the second filmed shot 116B.

At 808, a first set of objects may be detected from the extracted firstimage frame 122 of the first filmed shot 116A and a second set ofobjects may be detected from the extracted second image frame 124 of thesecond filmed shot 1168. The object detector 204 may be configured todetect the first set of objects from the extracted first image frame 122of the first filmed shot 116A and the second set of objects from theextracted second image frame 124 of the second filmed shot 1168.

At 810, a first object from each of the first set of objects and thesecond set of objects may be identified based on an object type of thefirst object, where the object type of the first object in the first setof objects and the second set of objects is same. In accordance with anembodiment, the object detector 204 may be configured to identify thefirst object from each of the first set of objects and the second set ofobjects based on the object type of the first object.

At 812, first angle information from the first image frame 122 andsecond angle information from the second image frame 124 may beretrieved. In accordance with an embodiment, the circuitry 202 may beconfigured to retrieve the first angle information from the first imageframe 122 and the second angle information from the second image frame124.

At 814, a first three-dimensional (3D) image of the first object in thefirst image frame 122 may be generated based on the retrieved firstangle information. In accordance with an embodiment, the circuitry 202may be configured to generate the first 3D image of the first object inthe first image frame 122 based on the retrieved first angleinformation. The generation of the first 3D image of the first object inthe first image frame 122 may be described in detail, for example, inFIG. 5.

At 816, a second 3D image of the first object in the second image frame124 may be generated based on the retrieved second angle information. Inaccordance with an embodiment, the circuitry 202 may be configured togenerate the second 3D image of the first object in the second imageframe 124 based on the retrieved second angle information. Thegeneration of the second 3D image of the first object in the secondimage frame 124 may be described in detail, for example, in FIG. 5.

At 818, a deviation of a first visual attribute of the generated first3D image of the first object from a second visual attribute of thegenerated second 3D image of the first object may be detected. Inaccordance with an embodiment, the video editing device 102 may beconfigured to detect the deviation of the first visual attribute of thegenerated first 3D image of the first object from the second visualattribute of the generated second 3D image of the first object. Thefirst visual attribute and the second visual attribute may be of sametype. Examples of the first visual attribute and the second visualattribute may include, but are not limited to, a shape, a size, adimension, a color, a brightness value, a contrast value, a texture, anorientation, an angle of tilt, a posture, or textual information.

At 820, the notification information associated with the first objectmay be generated based on the detected deviation. The circuitry 202 maybe configured to generate the notification information associated withthe first object based on the detected deviation.

At 822, the display screen 104 may be controlled to output the generatednotification information associated with the first object in the firstimage frame 122 and the second image frame 124. The circuitry 202 may beconfigured to control the display screen 104 to output the generatednotification information associated with the first object in the firstimage frame 122 and the second image frame 124.

At 824, it may be detected whether an object in the detected first setof objects is missing in the detected second set of objects. Thecircuitry 202 may be configured to detect whether the object in thedetected first set of objects is missing in the detected second set ofobjects.

At 826, the notification information associated with the object missingin the detected second set of objects may be generated and output. Thecircuitry 202 may be configured to generate and output the notificationinformation associated with the object missing in the detected secondset of objects.

At 828, the first timeline information which indicates the era relatedto the first video content 112 may be retrieved. The circuitry 202 maybe configured to retrieve the first timeline information which indicatesthe era related to the first video content 112. The first timelineinformation may be described in detail, for example, in FIG. 6. In someembodiments, the circuitry 202 may be configured to receive the firsttimeline information from the user 110, via the I/O device 210. Thefirst timeline information may be included in the second user inputreceived from the user 110.

At 830, a second object may be identified from a plurality of objects,where the object type of the first object and the second object aresame. The circuitry 202 may be configured to identify the second objectfrom the plurality of objects stored in the server 106. Theidentification of the second object from the plurality of objects storedin the server 106 may be described, in detail, for example, in FIG. 6.

At 832, the second timeline information of the identified second objectmay be retrieved. The circuitry 202 may be configured to retrieve thesecond timeline information of the identified second object from theserver 106 as described in detail, for example, in FIG. 6.

At 834, the retrieved first timeline information of the first videocontent 112 and the retrieved second timeline information of theidentified second object may be compared. The circuitry 202 may beconfigured to compare the retrieved first timeline information of thefirst video content 112 and the retrieved second timeline information ofthe identified second object as described in detail, for example, inFIG. 6.

At 836, the notification information associated with the first objectmay be generated and output based on the comparison. The circuitry 202may be configured to generate and output the notification informationassociated with the first object based on the comparison as described indetail, for example, in FIG. 6.

At 838, the first metadata associated with each of the plurality offilmed shots of the first video content 112 may be retrieved. Thecircuitry 202 may be configured to retrieve the first metadataassociated with each of the plurality of filmed shots of the first videocontent 112. In some embodiments, the circuitry 202 may be configured todetermine details associated with the detected objects, as the firstmetadata, based on the analysis of the detected objects. Examples of thedetails, may include, but are not limited to, a size of an object, adimension of an object, an object type of an object, a geo-location atwhich the image frame 702 was captured, a posture of an object, anorientation of an object, a color of an object, a texture of an object,or a shape of an object. In some embodiments, the first metadata mayinclude artist information (for example name of a celebrity or a person)in case the detected object type is human. The first metadata associatedwith each of the plurality of filmed shots of the first video content112 may be described in detail, for example, in FIG. 7.

At 840, the second metadata associated with each of plurality of filmedshots of second video content may be retrieved. The circuitry 202 may beconfigured to retrieve the second metadata associated with each ofplurality of filmed shots of second video content as described indetail, for example, in FIG. 7. The second metadata may includeinformation related to the plurality of filmed shots of the second videocontent. The information related to the second video content mayinclude, but are not limited to, information about objects in theplurality of filmed shots, artist information of the plurality of filmedshots, or geo-location information at which the plurality of filmedshots are captured.

At 842, the first metadata associated with the first video content 112may be compared with the second metadata associated with second videocontent. The circuitry 202 may be configured to compare the firstmetadata associated with the first video content 112 with the secondmetadata associated with the second video content.

At 844, the notification information may be generated based on thecomparison. The circuitry 202 may be configured to generate thenotification information based on the comparison between the firstmetadata associated with the first video content 112 and the secondmetadata associated with the second video content.

At 846, the display screen 104 may be controlled to output the generatednotification information associated with the first metadata of the firstvideo content 112. The circuitry 202 may be configured to control thedisplay screen 104 o output the generated notification informationassociated with the first metadata of the first video content 112.Control passes to end 848.

Exemplary aspects of the disclosure may include a video editing device(such as the video editing device 102 of FIG. 1) that includes a memory(such as the memory 206 of FIG. 2) configured to store first videocontent (such as the first video content 112 of FIG. 1). The first videocontent 112 may include a set of filmed scenes (such as the set offilmed scenes 114A to 114B of FIG. 1) of a filmed sequence. Each of theset of filmed scenes 114A to 1148 may include a plurality of filmedshots, and each of the plurality of filmed shots may include a sequenceof image frames (such as the sequence of image frames 118 and thesequence of image frames 120 of FIG. 1). The video editing device 102may further include circuitry (such as the circuitry 202 of FIG. 2)configured to extract a first image frame (such as the first image frame122 of FIG. 1) of a first filmed shot (such as the first filmed shot116A of FIG. 1) of the plurality of filmed shots. The circuitry 202 maybe further configured to extract a second image frame (such as thesecond image frame 124 of FIG. 1) of a second filmed shot (such as thesecond filmed shot 1168 of FIG. 1) of the plurality of shots.

In accordance with an embodiment, the circuitry 202 may be configured todetect a first set of objects from the extracted first image frame 122of the first filmed shot 116A and a second set of objects from theextracted second image frame 124 of the second filmed shot 1168. Thecircuitry 202 may be further configured to identify a first object fromeach of the first set of objects and the second set of objects based onan object type of the first object. The object type of the first objectin the first set of objects and the second set of objects is same. Thecircuitry 202 may be further configured to generate notificationinformation associated with the first object based on a deviation of atleast a first visual attribute of the first object in the first imageframe 122 from at least a second visual attribute of the first object inthe second image frame 124. The first visual attribute and the secondvisual attribute may be of same type The circuitry 202 may be furtherconfigured to control a display screen (such as the display screen 104of FIG. 1) to output the generated notification information associatedwith the first object in the first image frame 122 and the second imageframe 124.

In accordance with an embodiment, the first filmed shot 116A and thesecond filmed shot 1168 may be consecutive shots of a first filmed scene(such as the first filmed scene 114A of FIG. 1) of the set of filmedscenes of the first video content 112. The first image frame 122 may bea last frame among the sequence of image frames of the first filmed shot116A and the second image frame 124 may be a first frame among thesequence of image frames of the second filmed shot 1168. In accordancewith an embodiment, the first filmed shot 116A and the second filmedshot 1168 correspond to the first filmed scene 114A of the set of filmedscenes of the first video content 112.

In accordance with an embodiment, the first filmed shot 116A and thesecond filmed shot 1168 may correspond to different filmed scenes of theset of filmed scenes of the first video content 112, where the differentfilmed scenes may correspond to same context.

In accordance with an embodiment, the at least first visual attributeand the at least second visual attribute may include at least one of ashape, a size, a dimension, a color, a brightness value, a contrastvalue, a texture, an orientation, an angle of tilt, a posture, ortextual information of the first object.

In accordance with an embodiment, the first filmed shot 116A may includefirst angle information and the second filmed shot 116B may includesecond angle information. The circuitry 202 may be further configured togenerate a first three-dimensional (3D) image of the first object in theextracted first image frame 122 of the first filmed shot 116A based onthe first angle information. The circuitry 202 may be further configuredto generate a second 3D image of the first object in the extractedsecond image frame 124 of the second filmed shot 1168 based on thesecond angle information. The circuitry 202 may be further configured togenerate the notification information based on a difference in thedetermined first 3D image and the determined second 3D image.

In accordance with an embodiment, the first video content 112 mayinclude first timeline information which indicates an era related to thefirst video content 112. The circuitry 202 may be further configured tocompare the first object in the extracted first image frame 122 with aplurality of objects in a master database on a server (such as theserver 106) based on the object type of the first object and an objecttype of the plurality of objects in the master database. The circuitry202 may be further configured to retrieve second timeline informationassociated with an object of the plurality of objects in the masterdatabase based on the comparison. The second timeline information mayindicate timing information of invention, discovery, use, marketrelease, or evolution of the object. The circuitry 202 may be furtherconfigured to compare the first timeline information with the retrievedsecond timeline information and generate the notification informationassociated with the first object based on the comparison of the firsttimeline information with the retrieved second timeline information.

In accordance with an embodiment, the notification information mayinclude a first error message that may indicate a production error indepiction of the first object in the extracted second image frame 124.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine a first position of the first object in thefirst image frame 122 and a second position of the first object in thesecond image frame 124. The circuitry 202 may be further configured togenerate the notification information based on a position deviation ofthe first position from the second position, where the positiondeviation is based on context of the first filmed scene 114A that mayinclude the first object.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine whether the first object in the detected firstset of objects is absent in the detected second set of objects of theextracted second image frame 124. The circuitry 202 may be furtherconfigured to generate the notification information, as a second errormessage, based on the determination that the first object is absent inthe detected second set of objects of the extracted second image frame124.

In accordance with an embodiment, the notification information mayinclude at least one of the object type related to the first object,identification information the first filmed shot, or an identifier ofthe first filmed scene 114A that may include the first object. Inaccordance with an embodiment, the circuitry 202 may be furtherconfigured to add the notification information as metadata in the firstvideo content 112.

In accordance with an embodiment, the circuitry 202 may be furtherconfigured to determine a first metadata associated with the detectedfirst set of objects and the second set of objects. The first metadatamay include information that may indicate a geo-location at which eachof the plurality of filmed shots is captured, artist information, orinformation about the detected first set of objects, and the detectedsecond set of objects. In accordance with an embodiment, the circuitry202 may be further configured to retrieve a second metadata associatedwith a plurality of filmed shots related to second video content fromthe server 106. The circuitry 202 may be further configured to comparethe first metadata of the first video content 112 with the secondmetadata of the second video content. The circuitry 202 may be furtherconfigured to generate the notification information based on thecomparison. The notification information may include a third errormessage that may indicate a reproduction of a filmed scene or a filmedshot of the first video content 112 in the second video content.

Various embodiments of the disclosure may provide a non-transitory,computer readable medium and/or storage medium, and/or a non-transitorymachine readable medium and/or storage medium stored thereon, a machinecode and/or a computer program with at least one code section executableby a machine and/or a computer for continuity check in video content.The at least one code section may cause the machine and/or computer toperform the steps that comprise storage of first video content thatcomprises a set of filmed scenes of a filmed sequence; where each of theset of filmed scenes comprises a plurality of filmed shots, and whereineach of the plurality of filmed shots comprises a sequence of imageframes. Further, a first image frame of a first filmed shot of theplurality of filmed shots and a second image frame of a second filmedshot of the plurality of filmed shots may be extracted. A first set ofobjects from the extracted first image frame and a second set of objectsfrom the extracted second image frame may be detected. A first objectfrom each of the first set of objects and the second set of objectsbased on an object type of the first object may be identified. Theobject type of the first object in the first set of objects and thesecond set of objects may be same. Notification information associatedwith the first object may be further generated based on a deviation ofat least a first visual attribute of the first object in the first imageframe from at least a second visual attribute of the first object in thesecond image frame, wherein the first visual attribute and the secondvisual attribute are of same type. A display screen may be furthercontrolled to output the generated notification information associatedwith the first object in the first image frame and the second imageframe.

The present disclosure may be realized in hardware, or a combination ofhardware and software. The present disclosure may be realized in acentralized fashion, in at least one computer system, or in adistributed fashion, where different elements may be spread acrossseveral interconnected computer systems. A computer system or otherapparatus adapted to carry out the methods described herein may besuited. A combination of hardware and software may be a general-purposecomputer system with a computer program that, when loaded and executed,may control the computer system such that it carries out the methodsdescribed herein. The present disclosure may be realized in hardwarethat comprises a portion of an integrated circuit that also performsother functions.

The present disclosure may also be embedded in a computer programproduct, which comprises all the features that enable the implementationof the methods described herein, and which when loaded in a computersystem is able to carry out these methods. Computer program, in thepresent context, means any expression, in any language, code ornotation, of a set of instructions intended to cause a system withinformation processing capability to perform a particular functioneither directly, or after either or both of the following: a) conversionto another language, code or notation; b) reproduction in a differentmaterial form.

While the present disclosure is described with reference to certainembodiments, it will be understood by those skilled in the art thatvarious changes may be made and equivalents may be substituted withoutdeparture from the scope of the present disclosure. In addition, manymodifications may be made to adapt a particular situation or material tothe teachings of the present disclosure without departure from itsscope. Therefore, it is intended that the present disclosure not belimited to the particular embodiment disclosed, but that the presentdisclosure will include all embodiments that fall within the scope ofthe appended claims.

1. A video editing device, comprising: a memory configured to storefirst video content that comprises a set of filmed scenes of a filmedsequence, wherein each of the set of filmed scenes comprises a pluralityof filmed shots, and each of the plurality of filmed shots comprises asequence of image frames; and circuitry configured to: extract a firstimage frame of a first filmed shot of the plurality of filmed shots anda second image frame of a second filmed shot of the plurality of filmedshots; detect a first set of objects from the extracted first imageframe and a second set of objects from the extracted second image frame;identify a first object from each of the first set of objects and thesecond set of objects based on an object type of the first object,wherein the object type of the first object in the first set of objectsand the second set of objects is same; generate notification informationassociated with the first object based on a deviation of at least afirst visual attribute of the first object in the extracted first imageframe from at least a second visual attribute of the first object in theextracted second image frame, wherein the first visual attribute and thesecond visual attribute are of same type; and control a display screento output the generated notification information associated with thefirst object in the extracted first image frame and the extracted secondimage frame, wherein the generated notification information comprises afirst error message that indicates a production error in depiction ofthe first object in the extracted second image frame.
 2. The videoediting device according to claim 1, wherein the first filmed shot andthe second filmed shot are consecutive shots of a first filmed scene ofthe set of filmed scenes of the first video content, and the first imageframe is a last frame among the sequence of image frames of the firstfilmed shot and the second image frame is a first frame among thesequence of image frames of the second filmed shot.
 3. The video editingdevice according to claim 1, wherein the first filmed shot and thesecond filmed shot correspond to a first filmed scene of the set offilmed scenes of the first video content.
 4. The video editing deviceaccording to claim 1, wherein the first filmed shot and the secondfilmed shot correspond to different filmed scenes of the set of filmedscenes of the first video content, and the different filmed scenescorrespond to same context.
 5. The video editing device according toclaim 1, wherein the at least first visual attribute and the at leastsecond visual attribute comprise at least one of a shape, a size, adimension, a color, a brightness value, a contrast value, a texture, anorientation, an angle of tilt, a posture, or textual information of thefirst object.
 6. The video editing device according to claim 1, whereinthe first filmed shot includes first angle information and the secondfilmed shot includes second angle information, the first angleinformation indicates a first angle at which the first filmed shot iscaptured, and the second angle information indicates a second angle atwhich the second filmed shot is captured.
 7. The video editing deviceaccording to claim 6, wherein the circuitry is further configured to:generate a first three-dimensional (3D) image of the first object in theextracted first image frame based on the first angle information;generate a second 3D image of the first object in the extracted secondimage frame based on the second angle information; and generate thenotification information based on a difference between the first 3Dimage and the second 3D image.
 8. The video editing device according toclaim 1, wherein the first video content includes first timelineinformation which indicates an era associated with the first videocontent.
 9. The video editing device according to claim 8, wherein thecircuitry is further configured to: compare the first object in theextracted first image frame with a plurality of objects in a masterdatabase on a server based on the object type of the first object and anobject type of the plurality of objects in the master database; retrievesecond timeline information associated with an object of the pluralityof objects in the master database based on the comparison between thefirst object and the plurality of objects, wherein the second timelineinformation indicates timing information of at least one of invention,discovery, use, market release, or evolution of the object; compare thefirst timeline information with the retrieved second timelineinformation; and generate the notification information associated withthe first object based on the comparison of the first timelineinformation with the retrieved second timeline information. 10.(canceled)
 11. The video editing device according to claim 1, whereinthe circuitry is further configured to: determine a first position ofthe first object in the first image frame and a second position of thefirst object in the second image frame; and generate the notificationinformation based on a position deviation of the first position from thesecond position, wherein the position deviation is based on context of afirst filmed scene that comprises the first object.
 12. The videoediting device according to claim 1, wherein the circuitry is furtherconfigured to: determine whether the first object in the detected firstset of objects is absent in the detected second set of objects of theextracted second image frame; and generate the notification information,as a second error message, based on the determination that the firstobject is absent in the detected second set of objects of the extractedsecond image frame.
 13. The video editing device according to claim 1,wherein the notification information further comprises at least one ofthe object type related to the first object, identification informationthe first filmed shot, or an identifier of a first filmed scene thatcomprises the first object, and the circuitry is further configured toadd the notification information as metadata in the first video content.14. The video editing device according to claim 1, wherein the circuitryis further configured to: determine a first metadata associated with thedetected first set of objects and the detected second set of objects,wherein the first metadata comprises information that indicates at leastone of a geo-location at which each of the plurality of filmed shots iscaptured, artist information, or information about the detected firstset of objects and the detected second set of objects.
 15. The videoediting device according to claim 14, wherein the circuitry is furtherconfigured to: retrieve a second metadata associated with the pluralityof filmed shots associated with second video content from a server;compare the first metadata of the first video content with the secondmetadata of the second video content; and generate the notificationinformation based on the comparison between the first metadata and thesecond metadata, wherein the notification information further comprisesa second error message that indicates a reproduction of a filmed sceneor a filmed shot of the first video content in the second video content.16. A method, comprising: in a video editing device: storing first videocontent that comprises a set of filmed scenes of a filmed sequence;wherein each of the set of filmed scenes comprises a plurality of filmedshots, and each of the plurality of filmed shots comprises a sequence ofimage frames; extracting a first image frame of a first filmed shot ofthe plurality of filmed shots and a second image frame of a secondfilmed shot of the plurality of filmed shots; detecting a first set ofobjects from the extracted first image frame and a second set of objectsfrom the extracted second image frame; identifying a first object fromeach of the first set of objects and the second set of objects based onan object type of the first object, wherein the object type of the firstobject in the first set of objects and the second set of objects issame; generating notification information associated with the firstobject based on a deviation of at least a first visual attribute of thefirst object in the extracted first image frame from at least a secondvisual attribute of the first object in the extracted second imageframe, wherein the first visual attribute and the second visualattribute are of same type; and controlling a display screen to outputthe generated notification information associated with the first objectin the extracted first image frame and the extracted second image frame,wherein the generated notification information comprises a first errormessage that indicates a production error in depiction of the firstobject in the extracted second image frame.
 17. The method according toclaim 16, wherein the first filmed shot and the second filmed shot areconsecutive shots of a first filmed scene of the set of filmed scenes ofthe first video content, and the first image frame is a last frame amongthe sequence of image frames of the first filmed shot and the secondimage frame is a first frame among the sequence of image frames of thesecond filmed shot.
 18. The method according to claim 16, wherein thefirst filmed shot and the second filmed shot correspond to a firstfilmed scene of the set of filmed scenes of the first video content. 19.The method according to claim 16, wherein the first filmed shot and thesecond filmed shot correspond to different filmed scenes of the set offilmed scenes of the first video content, and the different filmedscenes correspond to same context.
 20. The method according to claim 16,wherein the at least first visual attribute and the at least secondvisual attribute comprise at least one of a shape, a size, a dimension,a color, a brightness value, a contrast value, a texture, anorientation, an angle of tilt, a posture, or textual information of thefirst object.
 21. A video editing device, comprising: a memoryconfigured to store first video content that comprises a set of filmedscenes of a filmed sequence, wherein each of the set of filmed scenescomprises a plurality of filmed shots, and each of the plurality offilmed shots comprises a sequence of image frames; and circuitryconfigured to: extract a first image frame of a first filmed shot of theplurality of filmed shots and a second image frame of a second filmedshot of the plurality of filmed shots; detect a first set of objectsfrom the extracted first image frame and a second set of objects fromthe extracted second image frame; identify a first object from each ofthe first set of objects and the second set of objects based on anobject type of the first object, wherein the object type of the firstobject in the first set of objects and the second set of objects issame; determine whether the first object in the detected first set ofobjects is absent in the detected second set of objects of the extractedsecond image frame; generate notification information, as an errormessage, associated with the first object based on a deviation of atleast a first visual attribute of the first object in the first imageframe from at least a second visual attribute of the first object in thesecond image frame and based on the determination that the first objectis absent in the detected second set of objects, wherein the firstvisual attribute and the second visual attribute are of same type; andcontrol a display screen to output the generated notificationinformation associated with the first object in the first image frameand the second image frame.